Load balancing techniques for inter-domain traffic engineering

ABSTRACT

A method for balancing traffic across paths connecting a network to the Internet using a fractional allocation strategy for distributing the traffic from a congested selected path. The strategy includes: (a) associating the paths j with a counter i; (b) calculating the total initial selected path overload; (c) calculating the selected path load, wherein the load is equal to the initial selected path overload less the sum of the low capacity boundary for i path(s); (d) calculating the portion of the traffic on the selected path to be distributed using a bi-sectional search strategy; (e) distributing a portion of the traffic on the selected path to the other paths; and (f) stopping if there are no more paths (i=j), otherwise increasing the numerical value of the counter by one (1) and go to step (c).

BACKGROUND OF INVENTION

The present invention relates to a method for the management ofinter-domain traffic over the Internet. In particular, the presentinvention relates to methods for selecting the most efficient anduncongested paths for inter-domain traffic.

A major challenge for inter-domain traffic engineering is the level ofuncertainty an Internet Service Provider (ISP) is facing when selectingpaths for Internet traffic. With many potentially competing ISPs, globalcoordination of Internet traffic to select the best path and minimizecongestion can be difficult. Moreover, a single ISP only has limitedknowledge about user demand, available network resources, and routingpolicies at the peers.

One of the major problems encountered in inter-domain trafficengineering is appropriately reacting to congestion, losses, and/ordelays experienced by traffic between a home network and other domains.This problem is encountered in a home network with multiple links intoanother domain, such as large ISPs with multiple peering links toanother backbone, or a smaller local network multi-homed to two or morebackbones. In such cases, the company paying for these links may findthat, under normal policies, traffic exiting the home network throughone of the links is experiencing congestion, while other links areunder-utilized. In some cases, the congestion experienced by trafficleaving or entering the home network may actually occur outside itsdomain. In order to improve the performance experienced by theconnections generating the associated traffic, the ISP operator needs tofind a way to alleviate the congestion, losses and/or delays.

In simple cases, where the congestion is on the home network's egresslink, congestion may be detected via Simple Network Management Protocol(SNMP) traffic measurements. SNMP is a set of protocols for managingcomplex networks which was first used in the 1980s. SNMP works bysending messages, called protocol data units (PDUs), to different partsof a network. SNMP-compliant devices, called agents, store data aboutthemselves in Management Information Bases (MIBs) and return this datato the SNMP requesters. A user can determine an alternate route for thetraffic by using SNMP to measure the available capacity on the alternateegress links. Under such circumstances, there are any number ofapproaches to the problem of load balancing the traffic with the onlyreal problem being the way it is implemented because typicalinter-domain routing using Border Gateway Protocol (BGP) is a relativelycoarse mechanism for traffic engineering.

A variety of metrics can also be used when a user is trying to determinethe best path to use for sending traffic over a network. Some routingprotocols, such as Routing Information Protocol (RIP), use only onemetric and that is hop count. And some routing protocols, such asInterior Gateway Routing Protocol (IGRP), use a combination of metrics.The metrics most commonly used are: (1) hop count—the number of routersthat a packet must go through to reach its destination; (2)bandwidth—the data capacity of a link; (3) delay—the length of time tomove the packet from the source to destination; (4) load—the amount ofactivity on a network resource; (5) reliability—the error rate of eachnetwork link; (6) ticks—the delay on a data link using IBM PC clockticks; and (7) cost—an arbitrary value assigned by an administrator. Thebest route depends on the metrics and metric weightings used to make thecalculation. For example, one routing protocol might use the number ofhops and the delay, but might weigh the delay more heavily in thecalculation. Thus, a route having more hops and shorter delays may beless expensive than a route having fewer drops and longer delays. Pathsthat are expensive to use are usually avoided. Such metrics are a usefultool but they cannot accurately account for traffic in other domains.

Leaving aside the practical problems of implementing a method forbalancing the traffic load, there is a more fimdamental problem. In somecases, the congestion experienced by traffic leaving or entering thehome network may actually occur outside the domain so that the operatoris unable to make direct observations of the traffic along differentpaths. Indeed, the operator's knowledge is limited to the home networkand he or she does not necessarily know the routing, topology, orcapacity of the network that the traffic traverses. However, theoperator can often still detect congestion in the network using activeprobes, and the operator may want to take remedial actions to alleviateor avoid this congestion.

If the operator chooses to alleviate or avoid congestion that isdetected on the network, he or she must balance loads across multiplepaths without knowledge of the available capacity and the details of thepaths themselves. The only information available to the operator arecoarse measurements of statistics (such as loss rate) and Round TripTime (RTT), which can be used to infer congestion. In addition, theoperator may be able to use some of the more recently developedtechniques to estimate bottleneck bandwidth, for example, an approachbased on network tomography. These mechanisms are unlikely to give anentirely accurate picture of the paths in question, but can narrow downthe range of possibilities.

A further, and even more difficult problem, is that multiple operatorsmay decide to use similar approaches to avoid congestion and may besimultaneously rerouting their traffic. In this case, they will play akind of “game” against each other to optimize the utilization of theexternal domain, not knowing that other operators are changing trafficpaths. Based on past experience, it has been found that such “games”(where individuals optimize their own utility) do not always lead to theglobal optimal behavior.

Accordingly, there is a need for a method which allows the networkoperator to ease congestion and to balance the loads on the networkpaths. More specifically, there is a need for a method which allows theoperator to estimate the amount of traffic across different paths in anetwork on a fairly accurate basis and to reroute some of the traffic touncongested network paths.

SUMMARY OF THE INVENTION

In accordance with the present invention, a method for balancing trafficacross paths connecting a network to the Internet is provided. Themethod includes forming a connection between a home network and a largenetwork which connects to a plurality of networks. The connection fromthe home network to the larger network includes a plurality of pathscarrying traffic in the form of data packets and each path has a pathload. One of the paths is selected from the plurality of paths whichincludes the selected path and other paths. The selected path has atraffic load and an initial overload. Each path connecting the homenetwork to the larger network has a path load and the selected path hasa traffic load and an initial overload. In a preferred embodiment, thehome network is an internet service provider (ISP) and the largernetwork is the Internet. The method includes measuring the amount oftraffic from the home network to the large network over the selectedpath, the congestion over the selected path and the available capacityover the selected path. The method also includes choosing the path loadfor each of the plurality of paths using a fractional allocationstrategy, wherein the time to generate information is minimized and theamount of traffic lost to overloads is minimized and distributing aportion of the traffic from the selected path to the other paths basedon the fractional allocation strategy.

In a preferred embodiment of the present invention, the fractionalallocation strategy includes the steps of: (a) associating the pathswith a counter i, wherein the counter is a number equal to one (1) andthere are a total ofj paths; (b) calculating the total initial selectedpath overload; (c) calculating the selected path load, wherein the loadis equal to the initial selected path overload less the sum of the lowcapacity boundary for i path(s); (d) calculating the portion of thetraffic on the selected path to be distributed using a bi-sectionalsearch strategy, preferably a multidimensional iterative bisectionsearch algorithm; (e) distributing a portion of the traffic on theselected path to the other paths; and (f) stopping if there are no morepaths (i=j), otherwise increasing the numerical value of the counter byone (1) and go to step (c).

In another preferred embodiment of the present invention, the portion ofthe traffic is distributed to the other paths using the equation$\begin{matrix}{{x_{i} = {l_{i} + {\frac{h_{i} - l_{i}}{\sum\limits_{i = 1}^{P}( {h_{i} - l_{i}} )} \times ( {{x_{0}(0)} - {\sum\limits_{i = 1}^{P}l_{i}}} )}}},} & (1)\end{matrix}$wherein x_(i) is the path load, l_(i) is low capacity boundary, hi ishigh capacity boundary, P is the total number of paths and x₀(0) is theinitial overload on the selected path.

The cost for using a path connecting the home network to the largernetwork is measured using the equation${C = {\sum\limits_{t = 1}^{T}{\sum\limits_{i = 0}^{P}\lbrack {{x_{i}(t)} - {c_{i}(t)}} \rbrack^{+}}}},$wherein C is the cost, T is the time period over which the feasiblesolution is obtained, P is the number of paths between the home networkand the large network, x is path load and c is the capacity of the pathat time t.

In preferred embodiments, the amount of traffic from the home network tothe large network over the selected path is measured using flow levelmeasurements or Simple Network Management Protocol (SNMP). Thecongestion over the selected path is measured using active probes,passive measurements of traffic details Transmission Control Protocol(TCP) Synchronize/Acknowledgement (SYN/ACK) response time or Round TripTime (RTT), and loss measurements. And the available capacity over theselected path is measured using flow level measurements, Simple NetworkManagement Protocol (SNMP) link measurements, Round Trip Time (RTT),loss measurements, active probes, or Transmission Control Protocol (TCP)Synchronize/Acknowledgement (SYN/ACK) response time.

The method of the present invention allows a local network operator,such as an ISP, to balance traffic loads across paths connecting thelocal network to the Internet and minimize congestion and improveefficiency.

BRIEF DESCRIPTION OF THE FIGURES

Other objects and many attendant features of this invention will bereadily appreciated as the invention becomes better understood byreference to the following detailed description when considered inconnection with the accompanying drawings wherein:

FIG. 1 is a graph which illustrates the two-dimensional (2-d) case forthe multidimensional iterative bisection search (MIBS) method.

FIG. 2A is a bar graph showing a comparison of different search schemesbased on convergence time when x₀=10,000 and ω=1.00.

FIG. 2B is a bar graph showing a comparison of different search schemesbased on total cost when x₀=10,000 and ω=1.00.

FIG. 3A is a bar graph showing a comparison of different search schemesbased on convergence time when x₀=10,000 and ω=1.05.

FIG. 3B is a bar graph showing a comparison of different search schemesbased on total cost when x₀=10,000 and ω=1.05.

FIG. 4A is a graph showing a comparison of different search schemes forvarying numbers of paths based on convergence time when x₀=10,000 andω=1.00.

FIG. 4B is a graph showing a comparison of different search schemes forvarying numbers of paths based on total cost when x₀=10,000 and ω=1.00.

FIG. 5A is a graph showing a comparison of different search schemes forvarying numbers of paths based on convergence time when x₀=10,000 andω=1.05.

FIG. 5B is a graph showing a comparison of different search schemes forvarying numbers of paths based on total cost when x₀=10,000 and ω=1.05.

FIG. 6A is a graph showing a comparison of different search schemes forvarying numbers of paths based on convergence time when x₀=19,000 andω=1.20.

FIG. 6B is a graph showing a comparison of different search schemes forvarying numbers of paths based on total cost when x₀=10,000 and ω=1.20.

FIG. 7A is a graph showing a comparison of different search schemes fordifferent prior estimates of the pinning interval, α, based onconvergence time when x₀=10,000 and ω=1.01.

FIG. 7B is a graph showing a comparison of different search schemes fordifferent prior estimates of the pinning interval, α, based on totalcost when x₀=10,000 and ω=1.01.

FIG. 8A is a graph showing a comparison of different search schemes fordifferent available capacity for five (5) alternate paths based onconvergence time when x₀=10,000.

FIG. 8B is a graph showing a comparison of different search schemes fordifferent available capacity for five (5) alternate paths based on totalcost when x₀=10,000.

FIG. 9A is a graph showing a comparison of different search schemes fordifferent available capacity for ten (10) alternate paths based onconvergence time when x₀=10,000.

FIG. 9B is a graph showing a comparison of different search schemes fordifferent available capacity for ten (10) alternate paths based on totalcost when x₀=10,000.

FIG. 10A is a graph showing a comparison of different search schemes fordifferent available capacity for twenty (20) alternate paths based onconvergence time when x₀=10,000.

FIG. 10B is a graph showing a comparison of different search schemes fordifferent available capacity for twenty (20) alternate paths based ontotal cost when x₀=10,000.

FIG. 11A is a graph showing a worst case comparison of different searchschemes for varying numbers of paths based on convergence time whenx₀=10,000 and ω=1.00.

FIG. 11B is a graph showing a worst case comparison of different searchschemes for varying numbers of paths based on total cost when x₀=10,000and ω=1.00.

FIG. 12A is a graph showing a worst case comparison of different searchschemes for varying numbers of paths based on convergence time whenx₀=10,000 and ω=1.10.

FIG. 12B is a graph showing a worst case comparison of different searchschemes for varying numbers of paths based on total cost when x₀=10,000and ω=1.10.

DETAILED DESCRIPTION OF THE INVENTION

The present invention is a method for balancing traffic on a networkwhen the operator does not have complete information about the operatingand performance characteristics of the network. In order to balancenetwork traffic, it is desirable for the home network operator to beable to move at least some traffic to uncongested network paths to easecongestion. For the present application, the term “home network” refersto the network, such as an ISP network, for which the operator isbalancing the traffic entering and exiting.

A major challenge for inter-domain traffic engineering is the level ofuncertainty that the operator of an ISP faces. With many potentiallycompeting ISPs, global coordination can be difficult simply because ofthe size of the network. A single ISP only has limited knowledge aboutuser demand, available network resource, and routing policies at thepeers. In addition, the operator has limited control over how thetraffic is routed external to its domain, the home network. However, theISP operator can determine how (i.e., at what links) the traffic exitsits domain by using various tools such as Border Gateway Protocol (BGP).The present invention utilizes the capability to select home networkexit links and provides methods for traffic load balancing in the casewhere the network service provider possesses limited knowledge aboutuser demand, available network resources, and the routing policies usedon network paths. In particular, the present invention providesalgorithmic techniques for determining how traffic loads are shiftedfrom congested links to uncongested links with unknown availablecapacities.

In order to implement the method of the present invention, three typesof measurements must be available to the home network operator: (1) thetraffic contribution of the home network to any one path; (2) thecongestion on the paths connecting the ISP to the network; and (3) theavailable capacity on a path.

The home network operator must be able to measure the trafficcontribution of the home network to any one path. This can be done bythe home network operator fairly precisely, at whatever level isnecessary, either using flow level measurements, or SNMP linkmeasurements when it is equivalent to path traffic.

The home network operator must be able to measure congestion (inferredfrom Round Trip Time (RTT), and loss measurements) using active probes,or passive measurements of traffic details such as the TransmissionControl Protocol (TCP) Synchronize/Acknowledgement (SYN/ACK) responsetime. (TCP is one of the main protocols in TCP/IP networks. Whereas theIP protocol deals only with packets, TCP enables two hosts to establisha connection and exchange streams of data. TCP guarantees delivery ofdata and also guarantees that packets will be delivered in the sameorder in which they were sent.) Such measurements will, by theirstatistical nature, contain false positives and false negatives whichwill have to be factored into the calculations.

The home network operator must be able to measure the available capacityon a path. These measurement scan be obtained using flow levelmeasurements, SNMP link measurements, RTT, loss measurements, activeprobes, or passive measurements of traffic details, such as theTransmission Control Protocol (TCP) Synchronize/Acknowledgement(SYN/ACK) response time. However, these measurements are less accuratefor measuring path capacity than they are for measuring the trafficcontribution of the home network and congestion. Therefore, they areonly used to obtain approximate bounds on the available capacity of apath. A worst case estimate of the bounds on the capacity of a pathwould be [0,f_(i)], where f_(i) is the capacity of the first hop link,leaving the home network.

The method of the present invention is based on the assumption thatthere are P+1 egress paths from the home network and that 1 iscongested. The paths are indexed by I

{0, 1, . . . , P}. On each path, p_(i), at time t, it is assumed thatthere is available capacity, c_(i)(t), which can vary over time due tochanges in cross-traffic along the path of interest. In general, theavailable capacity, c_(i), is not known, except for some bounds,c_(i)(t)

[l_(i)(t), h_(i)(t)], obtained through measurements. For the methodsdescribed herein, this interval is called the pinning interval, α.

The traffic being sent to a specific path can be directly measured andit is denoted as x_(i) while, without the loss of generality, thecongested path is denoted as path “0.” It is assumed that the capacityover the congested path, c₀, is known precisely—the problem can besimply generalized if not, by including x₀ in the processes below. Undersuch assumptions, a simple transformation allows the problem to beconsidered, where c₀←0 and x₀←x₀−c₀, so that x₀ is now the excess loadthat needs to be distributed in order to alleviate congestion.

The congestion indications are denoted by indicator functions, where${C_{i}(t)} = \{ \begin{matrix}{1,} & {{{if}\quad{path}\quad i\quad{is}\quad{congested}\quad{at}\quad t},} \\{0,} & {{{otherwise}.}\quad}\end{matrix} $

The method of the present invention uses an infinite buffer model inwhich the amount of traffic dropped is modeled as the excess load,d_(i)(t)=[x_(i)(t)−c_(i)(t)]⁺, where [ . . . ]⁺ denotes the positivepart. This condition can be relaxed if the appropriate measurements totake such details into account are available. However, such measurements(for instance packet traces) are rarely available in practice.

The above statement also assumes that the time intervals, t=1, 2, 3, . .. , are sufficiently far apart that transient effects from changes, forinstance in the offered loads (i.e., the traffic sent to a path,x_(i)(t)), need not be considered and that congestion indications areinstantaneous, though in reality they will always involve some delay.Typically, this delay would be measured in seconds, whereas, the timescale of the algorithm of the present invention can be in the order ofminutes. In addition, the individual paths are assumed to be disjointedin the sense that they do not share bottlenecks. Otherwise, the pathssharing the bottleneck could be incorporated into one, larger path.

Statement of the Problem

The method of the present invention provides an iterative algorithm thatuses congestion indication measurements alone to determine a feasiblesolution to a load balancing problem. If a load balancing problem doesnot exist, the method will find the minimal cost solution. A minimalcost solution is found to be feasible if${{x_{i}(t)} \leq {c_{i}(t)}},{{\forall i} = 0},\ldots\quad,P,{\forall{t \geq T}},{{\sum\limits_{i = 1}^{P}{x_{i}(t)}} = x_{0}},$where T is the time at which convergence occurs. The minimum averagelong-term cost solution is the solution that minimizes the averagenumber of packets dropped over time and it can be expressed by thefollowing expression$\overset{\lim}{ Tarrow\infty }\frac{1}{T}{\sum\limits_{t}{\sum\limits_{i = 0}^{P}{\lbrack {{x_{i}(t)} - {c_{i}(t)}} \rbrack^{+}.}}}$

In addition, if there is a feasible solution, it would be desirable forthis solution to satisfy a number of properties. For instance, thesolution should have a short convergence time (the time, T, at which thefeasible solution is obtained should be as short as possible, both inexpectation, and mean) and the solution should drop as few packets aspossible so that it has a low cost, where cost is measured by$C = {\sum\limits_{t = 1}^{T}{\sum\limits_{i = 0}^{P}{\lbrack {{x_{i}(t)} - {c_{i}(t)}} \rbrack^{+}.}}}$

The method does not assume that path traffic equals link traffic on theegress links, so there can be more than one set of paths across whichtraffic can be balanced.

The Static Case

The first case considered is the static case. In this case, it isassumed that: (1) the network, and cross-traffic loads do not changeover time, so that c_(i)(t)=c_(i), where c_(i) is a constant; (2) thetraffic is inelastic, and is not directly affected by changes in itsrouting, or the congestion level; and (3) the congestion indications oneach path are always correct.

In the limit where the discretization size B→0, a feasible solution canalways be found when $x_{0} \leq {\sum\limits_{i = 1}^{p}{c_{i}.}}$As the discretization increases, there may be cases where thisrelationship is satisfied, but there is no feasible solution. In orderfor a feasible solution to exist, a necessary and sufficient conditionis that${x_{o} \leq {\sum\limits_{i = 1}^{p}{\lbrack {c_{i}/B} \rbrack \times B}}},$where [x] denotes the largest integer less than x. If the capacity isdiscretized with the same quantization, B, as the traffic, then the twoconditions are equivalent.

Where there is no feasible solution, the minimum long-term cost solutionis any one for which all of the paths 1, . . . , P are congested. Any ofthese solutions will be equivalent with respect to long-term cost (asdefined above). Since it is relatively easy to find such solutions,these cases are not addressed.

When there is no feasible solution, a different cost metric is used. Forexample, it may be desirable to have congestion on as few paths aspossible to reduce the number of users impacted by congestion, or it maybe more desirable to have moderate levels of congestion on all paths,rather than a single path with high congestion. However, beforealternate approaches are discussed, the solution for the static case isfound.

Under the conditions for the static case, the bounds of the capacityestimates can be tightened after each time interval. For examplel _(i)(t+1)=max{l _(i)(t), x _(i)(t)}, C _(i)(t)=0,h _(i)(t+1)=min{h _(i)(t), x _(i)(t)}, C _(i)(t)=1.If the path is congested, then the available capacity on that path isless than the currently used capacity, whereas, if the path isuncongested, the available capacity is at least what is currently beingused. In the discrete case, slightly better bounds can be obtained bytakingh _(i)(t+1)=min{h _(i)(t), x _(i)(t)−B}, C _(i)(t)=1,because x_(i) can only be changed in multiples of B.

The problem is to choose the traffic sent to a path (also referred toherein as traffic path load or path load), x_(i)(t), at each timeinterval, t, so that the total load to be distributed, x₀(0), is spreadacross paths 0, . . . , P, and converges (with as low a cost aspossible) to a feasible solution, one in which x_(i)(t)≦c_(i)(t), ∀_(i).By choosing x_(i)(t+1)

(l_(i)(t+1), h_(i)(t+1)), it is possible to generate additionalinformation that can be used in the next iteration to refine the boundsfor the possible capacities, c_(i). Either the low capacity boundary,l_(i), can be increased, or the high capacity boundary, h_(i), can bedecreased.

The problem is how best to choose the traffic load sent to a path,x_(i), so as to generate information as quickly as possible, while atthe same time reducing the probability that traffic will be lost tooverloads. Several different strategies can be used to approach thisproblem and they are described below. In each case, the algorithm isterminated at the point when either a feasible solution is reached, orall links are congested. Convergence is guaranteed by the fact that theallowable region [l_(i), h_(i)] will decrease at each time step as longas at least one x_(i)

(l_(i), h_(i)) is chosen.

Proportional Allocation

One embodiment of the present invention approaches the problem ofchoosing the traffic load sent to a particular path by trying tominimize the expected losses at each iteration of the algorithm. Givenno other information, the simplest assumption that can be made is thatthe available capacity on each path is uniformly distributed between thebounds, that is, c_(i)˜U[l_(i)(t), h_(i)(t)], at some time, t. In fact,each measurement may be considered to provide more information thansimply the bounds. Therefore, the most reasonable distribution to usemight be something other than the uniform distribution. A preferredembodiment of the present invention uses the following allocation ofx_(i) $\begin{matrix}{{x_{i} = {l_{i} + {\frac{h_{i} - l_{i}}{\sum\limits_{i = 1}^{P}( {h_{i} - l_{i}} )} \times ( {{x_{0}(0)} - {\sum\limits_{i = 1}^{P}l_{i}}} )}}},} & (1)\end{matrix}$

This method is referred to in the present invention as “proportionalallocation.” The reason for selecting the name can be seen in the simplecase where l_(i)=0, ∀_(i). In this case, the allocations to each pathwill be proportional to the high boundary, h_(i), for that path.

In the discrete case, the path loads, x_(i), must be multiples of theblock size B. The algorithm can take several forms, but a simpleapproach is to take the x_(i)=B×└x_(i)/B┘, and then to take theremaining load, and distribute it in a round-robin fashion between thepaths.

The final algorithm then proceeds in a series of steps. The initialalgorithm uses coarse bandwidth estimates for the low, l_(i)(0), andhigh, h_(i)(0) boundaries, and then proceeds by choosing the path load,x_(i), as in equation (1). Once congestion has been measured on eachpath, the bounds are updated, and the steps are repeated, until either afeasible solution is reached, or all paths are congested. Although thisapproach minimizes the expected cost at each step, it does not minimizethe overall cost. Such a minimization also needs to reduce the boundaryintervals [l_(i), h_(i)] as quickly as possible.

Fractional Allocation Strategies

In the proportional allocation method discussed above, it was assumedthat the initial path load was zero, x₀=0, in order to minimize the costat each step. Otherwise, λ=−1, and hence x_(i)=h_(i), which yields avalue for x₀ that is not feasible.

In another embodiment of the present invention called the “FractionalAllocation Strategy,” an alternative method is used which does notassume x₀(t)=0, ∀t>0, but instead assigns only part of the initial pathload, x₀, to the other paths at each step. Fractional AllocationStrategy corresponds to the strategy where the initial overload on path“0” (x₀(0)) or the “selected path” is broken up into smaller segmentsand once a starting segment size (to be off-loaded and distributed tothe other paths connecting the home network to the larger network) hasbeen decided, the subsequent segment sizes are decided based on somerule (e.g., linear, bisection etc.). However, the segments bandwidth canbe distributed amongst users according to any criteria, withProportional Allocation used as the default rule. Algorithm 1, which isshown below, describes the Fractional Allocation scheme. Algorithm 1 --Fractional Allocation Strategy while !(Feasible Allocation) do UpdateLoad as${{Load}(t)} = {{x_{0}(0)} - {\sum\limits_{i = 1}^{P}{l_{i}(t)}}}$Calculate traffic y(t) to be distributed y(t) ≦ Load(t) according tolinear search, bi-section search or exponential search strategyDistribute y(t) amongst paths using equation (1)$( {{with}\quad( {{x_{0}(0)} - {\sum\limits_{i = 1}^{P}{li}}} )\quad{replaced}\quad{by}\quad y\quad(t)} )$Update pinning intervals [l_(i) (t + 1), h_(i) (t + 1)] end while

Three strategies can be used for choosing the traffic to be distributed(y(t)): (1) linear search, (2) bi-section search, and (3) exponentialsearch.

The “linear search” strategy has a parameter STEP_SIZE which is chosenby setting a number of rounds, N, that it will take for the algorithm toconverge. Hence,${STEP\_ SIZE} = {\frac{{x_{0}(0)} - {\sum\limits_{i = 1}^{P}{l_{i}(0)}}}{N}.}$At each iteration, the traffic to be distributed, y(t)=min{STEP_SIZE,Load(t)} of the load is assigned to the paths i=1, . . . , P.

As the algorithm progresses, the traffic being reallocated at each step,y(t), tends to be allocated to uncongested links, and so the lowerbounds, l_(i), on uncongested paths will increase, thereby decreasingthe load on path “0” by approximately the STEP_SIZE at each interval,and converging in after about N steps. Essentially, the aim of thisapproach is to minimize the congestion on paths, i=1, . . . , P.However, this strategy allows ongoing congestion on path “0” to persistfor longer periods of time. In some cases, it may be more desirable toallocate traffic this way, for example, when it is necessary to reducethe overall number of dropped packets and to keep the number ofcongested paths to a minimum at all times.

The “bi-section search” strategy is the preferred strategy of thepresent invention. The “bi-section search” strategy takes a parameter0<α<1, which would typically be 0.5 (thus, giving the search strategyits name) and then performs the algorithm defined by choosing thetraffic to be distributed, y(t), that satisfy${y(t)} = {\min\{ {{\alpha\lbrack {\sum\limits_{i = 1}^{P}( {{h_{i}(t)} - {l_{i}(t)}} )} \rbrack},{{Load}(t)}} \}}$This approach is intended to allocated the traffic more quickly at thestart, and then slows down as the bounds become more narrow.Accordingly, this approach would be expected to converge more quicklythan the linear search.

An alternative is the “exponential search” strategy, which is defined bythree parameters: EXP_BASE (chosen to be either constant, or a constantfraction of Load), EXP_FACTOR, which is (by default) 2, and ROUND_ID.The traffic to be distributed, y(t), are chosen usingy(t)=min{EXP_BASE×EXP_FACTOR^(ROUND) ^(—) ^(ID), Load(t)}Multidimensional Iterative Bisection Search

In a preferred embodiment of the present invention, a “multidimensionaliterative bisection search” (MIBS) is used wherein the entire initialcongested load from path “0” is allocated at each step. This approach isanalogous to a traditional binary search, though it is generalized to ahigher dimensional, constrained space.

To illustrate the MIBS approach, the simplest, non-trivial case, isconsidered, where there are P=2 alternate paths across the networkboundary which can be used to balance the excess load. The graph in FIG.1 illustrates this two-dimensional (2-d) case. The graph shows thecapacity constraints, x₁<c₁ and x₂<c₂, (vertical and horizontal dottedlines, respectively), and the constraint, x₁+×₂=x₀, is plotted as adashed line (the constraint line). The feasible solutions fall on thesmall solid segment of the constraint line.

The multidimensional iterative bisection search approach can start withzero knowledge of the bounds (i.e., effectively it is assumed that thelow and high boundaries equal zero (l_(i)=0 and h_(i)=x₀(0)) for all i.When better information is obtained about the initial pinninginterval(α), it can be easily exploited. The approach is initialized byusing the proportional allocations, which in this (zero knowledge) caseassigns equal traffic to paths 1 and 2. The approach proceeds much as atypically binary search would proceed. A new set of pinning intervals isselected based on the constraints that are satisfied. The pinninginterval, α, (along the constraint line) is then bisected and testedagain. This is shown by the series of dots in FIG. 1.

The great advantage of the local search approach is that it provablyconverges to the feasible region very quickly. Its worst caseperformance (for the 2-d case) is T=log₂(1/L), where L is the proportionof the constraint line (in the positive quadrant of IR²), that isfeasible. In cases where the solution must be discretized, the smallestfeasible region (if one exists) is L˜B/{square root}{square root over(2x₀(0))}, and so the convergence time will increase for larger initialloads, and decrease for larger discretizations (assuming the availablecapacities are scaled appropriately).

In fact, for the two-dimensional case, if the total available capacity,${c = {\sum\limits_{i = 1}^{P}c_{i}}},$

is no more than the load to be distributed, the multidimensionaliterative bisection search is provably optimal. This can be shown by thecase where c=x₀(0). In this case, a standard binary search with itsnormal properties is simply being performed. Algorithm 2 -Multidimensional Iterative Bisection Search NUM_ITERATIONS: Number ofLocal Iterations while! (Feasible Allocation) do Allocate RatesProportionally amongst all paths i = 0 while (I + + < NUM_iterations) doShuffle the list of paths. while (! End of List of Paths) do Pick anunpicked pair from the List, (Let paths i, j be picked.) s_(ij) =x_(i) + x_(j) LO_(i) = max(l_(i), s_(ij) − h_(j)) HI_(i) = min(h_(i),s_(ij) − l_(j)) LO_(j) = s_(ij) − HI_(i) HI_(j) = s_(ij) − LO_(i) x_(i)= LO_(i) + (HI_(i) − LO_(i))/2 x_(j) = s_(ij) − x_(i) end while endwhile end while

The disadvantage of the two-dimensional approach is that it is slightlymore complicated to generalize the algorithm to higher dimensions. Inhigher dimensions, it is necessary to run the equivalent of the binarysearch on a P-1 dimensional manifold embedded in the positive quadrantof R^(P). The algorithm discussed in the next section is a goodapproximation to this search.

Modified Multidimensional Iterative Bisection Search

Another embodiment of the present invention, is a modification of MIBS.In the two-dimensional case, MIBS is provably optimal if c≦x₀(0).However, it is no longer optimal if c>x₀(0). The modifiedmultidimensional iterative bisection search (MMIBS) shown below isprovably optimal (in the 2-d case) when the total capacity is given. Itis also provably better than MIBS when a true lower bound of the totalcapacity is known. When c≦x₀(0), MMIBS behaves exactly as the MIBSalgorithm. Algorithm 3 - Modified Multidimensional Iterative BisectionSearch if (know true TotalCapacity) then estCapacity = TotalCapacityelse estCapacity = max(totalDemand, capacityLowerBound) end if extra ifextraCapacity = estCapacity − totalDemand Initialize l_(i) = 0 and h₁ =estCapacity NUM-ITERATIONS: Number of Local Iterations while ! (FeasibleAllocation) do Allocate Rates Proportionally amongst all paths. i = 0while (!++<NUM_ITERATIONS)do Shuffle the list of paths. while (!End ofList of Paths) do Pick an unpicked pair from the List, (Let paths i,jare picked.) S_(ij) = x_(i) + x_(j) S = s_(ij) + extraCapacity LO_(i) =max(l_(i),S − h_(j)) HI_(i) = min(h_(i),S − l_(j)) LO_(j) = S − HI_(i)HI_(j) = S − LO_(i) x_(i) = LO_(i) + (HI_(i) − LO_(i) − extraCapacity)/2x_(i) = max(min(x_(i),s_(ij)),0) x_(j) = s_(ij) − x_(i) end while endwhile end while

EXAMPLES

The examples set forth below serve to provide further appreciation ofthe invention but are not meant in any way to restrict the scope of theinvention.

The examples illustrate the performance of the different approaches forthe static case. The results were generated using simulations in whichthe true available capacities on paths i=0, . . . , P are chosenrandomly by choosing a set of random variable z_(i)˜U(0,1), (that is,uniformly on the interval (0,1)), and then setting the capacities to be${c_{i} = {\omega \times \frac{z_{i}{x_{0}(0)}}{\sum\limits_{i = 1}^{P}z_{i}}}},$where the number of available paths, ω is a tunable parameter in orderto allow the size of the feasible region to be varied. In the continuouscase, when the number of available paths equals one (ω=1), there isexactly enough capacity to allow all of the traffic to be carried; whenthe number of available paths is greater than one (ω≧1) a feasiblesolution exists, and when the number of available paths is less than one(ω<1), no feasible solution exists. In the discrete case, the capacity,c_(i), is also discretized, so that the number of available paths (ω) nolonger has the same strict relationship to the feasibility of thesolution.

Where not otherwise specified, initial estimates are generated forl_(i)=0 and h_(i) by a random variable distributed between the maximumcapacity of any path, and the load to be distributed, for exampleh _(i) =h˜U(c _(max) ,x ₀(0)), ∀i=1, . . . , P.

Results for different allocation strategies are presented as the numberof available paths (ω) and the accuracy of initial measurements ofpinning intervals (α) vary. The results are also presented for the worstcase performance of these allocation strategies. The performance metricsof interest are convergence time and total cost incurred. The resultspresented in the examples show the convergence times and total costsover 1,000 simulations.

In all of the results, the discretized version is used, with B=1, andthe initial load x₀(0) is varied rather than B. An equivalent approachwould be to fix x₀(0), and vary 1/B in the same way.

Example 1 Comparison of Various Allocation Strategies

This example compares the different allocations strategies: proportionalallocation, linear-search, bisection search, exponential search andmultidimensional iterative bisection search (MIBS). FIGS. 2A-B and FIGS.3A-B show comparisons of these strategies, through both the convergencetime and the total cost. The graphs show box plots giving the mean,quartiles, and extreme values of the 1,000 simulations, for a range ofvalues for the number of available paths (ω). The results clearly showthat the linear and exponential approaches are much worse than anyalternative. This is because these schemes are conservative in theirapproach, i.e., they do not cause congestion on any uncongested path,and thus incur high cost by not easing the congestion on the congestedpath.

For a small number of available paths (ω)), the bisection approach isalso significantly worse than the proportional allocation and localsearch approach, which have very similar results. Given the additionalcomplexity of the bisection approach, over straight proportionalallocation, it does not provide any advantages. Also, the performance ofMIBS is the same or marginally better than that of proportionalallocation. Therefore, the MIBS and proportional allocation strategiesare preferred.

Example 2 Comparisons of Different Allocation Strategies for VariableNumber of Alternate Paths

This example studies the proportional allocation and themultidimensional iterative bisection search (MIBS) approaches. In thisexample, the total number of alternate paths that are available variedfrom 2 to 20 for a constant path load, x₀(0)=10,000. The number ofavailable paths, ω, also varied from 1.0 to 1.2. FIGS. 4 to 6 show theresults—giving the average values as points, and the 95 percentiles ofthe distribution of results as vertical bars. The results indicate, bothalgorithms have similar performance vis-a-vis convergence time. However,MIBS outperforms the other strategies when the total cost is consideredas a performance metric, particularly for large numbers of paths. Also,it is interesting to note that the cost and the convergence timesincrease as the total number of available paths increases, except forthe MIBS with respect to cost, which reaches a peak, and then staysthere.

Example 3 Accurate Measurements of the Pinning Interval

This example compares the approaches to allocation when more accuratemeasurements for the initial pinning interval, α, are available. In thesimulations, this was tested by drawing the lower and higher thresholdsof the pinning interval asl_(i)(0)

U[(1−α)c_(i) c_(i)] andh_(i)(0)

U[c_(i) c_(i)(1−α)]for different values of α. Specifically, the pinning interval, α, wasvaried from 0.05 to 1. FIG. 7 shows the results. The proportionalallocation strategy converges faster than MIBS though the difference inconvergence times is less than 1 iteration. Also, the results suggestthat convergence times as well as total costs decrease as the value ofthe pinning interval, α, decreases. This is intutitive as a decreasingvalue of pinning intervals, α, suggests that we have more accurateestimates of the pinning interval, α, especially the lower bounds.

Example 4 Performance of Different Allocation Strategies for DifferentValues of Available Capacity

This example presents the results for the performance of theproportional and multidimensional iterative bisection search (MIBS)strategies for different values of available capacities. For all theexperiments, the total initial load, x₀(0), was maintained at a constantvalue of 10,000 and the total available capacity was varied from 10,000to 12,500 by varying the number of available paths, ω. The total numberof paths available was also varied from 2 to 20. However, FIGS. 8A-B,9A-B and 10A-B show the results for only 5, 10 and 20 paths. Thesefigures show that, as the available capacity increases, the convergencetime and the total cost incurred decreases. This is because greateravailable capacity implies a bigger feasible region and, thus, it iseasier and faster to find a feasible solution. FIGS. 8 to 10 also showthat, as the available capacity increases, MIBS outperforms theproportional scheme. Moreover, the gains are more significant if thenumber of available paths is larger.

Example 5 Focused Capacity Performance of Different AllocationStrategies

In this example, the performance of the three schemes was compared whenthe available capacity is focused on one alternative path. Theconditions for the tests included a constant initial load (x₀(0)) and agiven number of available paths (P), with a total available capacityc=x₀(0), P-1 paths with capacity c_(i)=1, and one path with c_(i)=c−n+1units of available capacity. FIGS. 11A-B and 12A-B show the results ofthese tests.

Thus, while there have been described the preferred embodiments of thepresent invention, those skilled in the art will realize that otherembodiments can be made without departing from the spirit of theinvention, and it is intended to include all such further modificationsand changes as come within the true scope of the claims set forthherein.

1. A method for balancing traffic across paths connecting a network tothe Internet comprising: forming a connection between a home network anda large network which connects to a plurality of networks, wherein saidconnection comprises a plurality of paths carrying traffic in the formof data packets between the home network and said large network, andwherein each path has a path load; selecting one of said plurality ofpaths, wherein said plurality of paths comprises said selected path andother paths, and wherein said selected path has a traffic load and aninitial overload; measuring the amount of traffic from the home networkto the large network over the selected path; measuring the congestionover the selected path; measuring the available capacity over theselected path; choosing the path load for each of said plurality ofpaths using a fractional allocation strategy, wherein the time togenerate information is minimized and the amount of traffic lost tooverloads is minimized; and distributing a portion of the traffic fromthe selected path to the other paths.
 2. The method for balancingtraffic across paths connecting a network to the Internet according toclaim 1, wherein the fractional allocation strategy comprises: (a)associating the paths with a counter i, wherein the counter is a numberequal to one (1) and there are a total of j paths; (b) calculating thetotal initial selected path overload; (c) calculating the selected pathload, wherein the load is equal to the initial selected path overloadless the sum of the low capacity boundary for i path(s); (d) calculatingthe portion of the traffic on the selected path to be distributed usinga bi-sectional search strategy; (e) distributing a portion of thetraffic on the selected path to the other paths; and (f) stopping ifthere are no more paths (i=j), otherwise increasing the numerical valueof the counter by one (1) and go to step (c).
 3. The method forbalancing traffic across paths connecting a network to the Internetaccording to claim 1, wherein the portion of the traffic is distributedto the other paths using the equation $\begin{matrix}{{x_{i} = {l_{i} + {\frac{h_{i} - l_{i}}{\sum\limits_{i = 1}^{P}( {h_{i} - l_{i}} )} \times ( {{x_{0}(0)} - {\sum\limits_{i = 1}^{P}l_{i}}} )}}},} & (1)\end{matrix}$ wherein x_(i) is the path load, l_(i) is low capacityboundary, h_(i) is high capacity boundary, P is the total number ofpaths and x₀(0) is the initial overload on the selected path.
 4. Themethod for balancing traffic across paths connecting a network to theInternet according to claim 2, wherein the portion of the traffic isdistributed to the other paths using the equation $\begin{matrix}{{x_{i} = {l_{i} + {\frac{h_{i} - l_{i}}{\sum\limits_{i = 1}^{P}( {h_{i} - l_{i}} )} \times ( {{x_{0}(0)} - {\sum\limits_{i = 1}^{P}l_{i}}} )}}},} & (1)\end{matrix}$ wherein x_(i) is the path load, l_(i) is low capacityboundary, h_(i) is high capacity boundary, P is the total number ofpaths and x₀(0) is the initial overload on the selected path.
 5. Themethod for balancing traffic across paths connecting a network to theInternet according to claim 2, wherein the bi-sectional search strategyuses a multidimensional iterative bisection search algorithm.
 6. Themethod for balancing traffic across paths connecting a network to theInternet according to claim 1, wherein the cost is measured using theequation${C = {\sum\limits_{t = 1}^{T}{\sum\limits_{i = 0}^{P}\lbrack {{x_{i}(t)} - {c_{i}(t)}} \rbrack^{+}}}},$and wherein C is the cost, T is the time period over which the feasiblesolution is obtained, P is the number of paths between the home networkand the large network, x is path load and c is the capacity of the pathat time t.
 7. The method for balancing traffic across paths connecting anetwork to the Internet according to claim 1, wherein the amount oftraffic from the home network to the large network over the selectedpath is measured using flow level measurements or Simple NetworkManagement Protocol (SNMP).
 8. The method for balancing traffic acrosspaths connecting a network to the Internet according to claim 1, whereinthe congestion over the selected path is measured using active probes,or passive measurements of traffic details.
 9. The method for balancingtraffic across paths connecting a network to the Internet according toclaim 1, wherein the congestion over the selected path is measured usingTransmission Control Protocol (TCP) Synchronize/Acknowledgement(SYN/ACK) response time.
 10. The method for balancing traffic acrosspaths connecting a network to the Internet according to claim 1, whereinthe congestion over the selected path is measured using Round Trip Time(RTT), and loss measurements.
 11. The method for balancing trafficacross paths connecting a network to the Internet according to claim 1,wherein the available capacity over the selected path is measured usingflow level measurements, Simple Network Management Protocol (SNMP) linkmeasurements, Round Trip Time (RTT), loss measurements, active probes,or Transmission Control Protocol (TCP) Synchronize/Acknowledgement(SYN/ACK) response time.